19:29
2026-06-10
arstechnica.com
artificial-intelligence
Google DeepMind releases DiffusionGemma, a model that runs local AI 4x faster
Google DeepMind released DiffusionGemma, a new AI model that generates text in parallel blocks rather than sequentially, achieving up to 700 tokens per second on an RTX 5090 GPU and over 1,000 tokens โฆ